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Abstract -This paper describes a fully static high speed 16 384 bit 
read only memory (ROM), designed and fabricated by using scaled 
MOS processing and innovative circuit techniques. Specially designed 
decoder structures and sense amplifiers enable address accessing in less 
than 45 ns typically. Extensive use of small signal amplification and 0 
volt threshold devices reduce active power to a mere 70 raA, and when 
the power down option is chosen standby currents leduce to only 15 
mA typically. Together with quick turnaround (contact mask program- 
mability), and small die size (20. 3K mil 2 ), this ROM provides a low 
power, low cost alternative to bipolar PROMV 

I. INTRODUCTION 

The high speed read only memory (ROM) market has been 
traditionally dominated by the bipolar PROM 's. However, the 
majority of them suffer from the disadvantages of excessive 
power dissipation, as well as high cost due to the inherent com- 
plex bipolar technology. 

The circuit described here uses scaled metal oxide silicon 
(MOS) technology to achieve a typical 45 ns address access 
time performance, very comparable to its bipolar counter parts. 
However, it only uses 350 MW active power, while the power 
down option reduces the standby dissipation to a mere 35 MW. 
Being contact mask programmable to achieve quick turn- 
around, the die size is still at 130 mil X 156 mil, thus making 
it a very attractive, low cost replacement for the bipolar 
PROM's. A side by side comparison of the features of a bipolar 
PROM and that of this circuit is shown below. 

Bipolar PROM High speed MOS ROM 



Power supply 
Input and output 
Typical access time 
Typical active power 
Typical standby power 
Power speed product 
Die size 

Number of critical 
masking steps 



Single 5 V 
TTL compatible 
50 ns 
650 MW 
650 MW 
32.5 nJ 
40 000 mil 2 

10 



Single 5 V 

TTL compatible 

45 ns 

350 MW 

75 MW 

15.8 nJ 

20 000 mil 2 



II. CIRCUIT DESIGN 
. In order to obtain high performance, many circuit tricks are 
used. All stages play a part in achieving the goal. The follow- 
ing is a stage by stage description of how this is done. 

A. Address Buffers 

To provide for power down feature, native devices in series 
with the depletion pull ups are used. The first inverter uses 
only native pull ups which help to minimize input level sensi- 
tivity that can be caused by process variations, such as implant. 
Since the native transistor threshold is always close to 0 V, an 
enhancement native first stage will make a much more stable 
/TTL detector than previous enhancement/depletion inverters 
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Fig. 1. Decoder driver. PU denotes power up, T 0 is an enhancement 
device, Ti is a zero V T device, and T 2 is a depletion device, 

[ 1 ] , since enhancement and depletion thresholds vary indepen- 
dently of each other. It is obvious that this will make the ratio 
of the input buffers much more stable -a very important con- 
dition for the 0.8-2.0 V TTL input levels. 

Another feature that is built into the address buffer design is 
that when the power down option is activated, all address lines 
will remain at about a V cc /2 level. This is critical in achieving a 
fast chip-select accessing time when the circuit is brought back 
from standby to operating mode, since the outputs are only \ 
level from the final value. 

A push-pull output stage is used for the address line drivers, 
together with a small depletion transistor to sustain the steady- 
state voltage level. This combination will provide the fast rise 
time needed with the minimum power dissipation— also a very 
important design goal. 

A typical propagation delay through one of these address 
buffers is 9 ns, while active power dissipation is less than 1 1 
MW; the standby power dissipation, of course, will be equal to 
the leakage current as a function of the process variations and 
temperature, but is seldom more than 5-10 juA. 

B. Decoder Drivers 

This is a very important stage in terms of power-speed prod- 
uct because it possesses the highest number of power dissipating 
pull ups (127), and has one of the more heavily loaded capaci- 
tance nodes-an ^-decoder will drive a maximum of 128 tran- 
sistors. This is a key in power-speed tradeoffs. 

A specially designed decoder circuit provides the answer to 
this challenge. As can be seen in Fig. 1, this circuit uses deple- 
tion and native pass transistors as isolation devices between the 
heavily loaded nor line node A , and the driver node B. Any 
X-decoder selection allows node A to rise to only one depletion 
threshold above V SSj while node B swings a full V cc , By using 
a native push-pull driver, T 3 is not required to deliver large 
amounts of current. Therefore, decoder current is minimized, 
while high speed is still maintained. The output stage is powered 
down high to conserve power up access time. 

The typical power dissipation of an X-decoder stage is 1.5 
MW, while the propagation delay is approximately 14 ns. 

C. Current Sense Amplifier 

The sense-amplifier has always been the weak link in MOS 
ROM's. Since high speed is required, a differential current 
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Fig. 2. Sense amplifier. 
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Fig, 3. Differential amplifiers and level shifter. 



sensing technique with limited voltage swings, using a dummy 
cell as a voltage reference, has been developed. 

As shown in Fig. 2, the sense-amplifier is basically a current 
level detection circuit [2] with feedback and clamps to limit 
the voltage swing of the output to slightly larger than one 
enhancement threshold. When node A swings approximately 
100 MV, node B will swing close to 500 MV, which is the 
maximum voltage excursion allowed in this design. The oper- 
ation works in the following manner. 

1) Case 1 Node A Conducting. There is a transistor (or bit) 
in the core for the selected address. The voltage and current 
level at node A will enable T 3 to turn on, so that the voltage 
at node B ( V B ) is given by 

V B = V CC - Fj-tfi)- Ff(T 4 )- A 
where 

Vt(T\) denotes the threshold voltage of enhancement transis- 
tor T 3 + body effect due to voltage at node A, Vj(T A ) denotes 
the threshold voltage of enhancement transistor T 4 + body 
effect due to voltage at node B, and A denotes the overdrive 
voltage necessary to sustain the cell current. 



2) Case 2 Node A Not Conducting. There is no transistor 
(or bit) in the core for the selected address. Since there is no 
current path at node A, voltage bias condition will turn tran- 
sistor T 3 off, resulting in V B given by 

where V^(T{) is similarly defined as above. Thus, it can be 
shown that node B swings slightly more than one enhancement 
threshold. Since it does not require large voltage swings, which 
will add to propagation delay, the advantage of this small sig- 
nal current sense amplifier is very obvious. 

The reference bias generator, also shown in Fig. 2, is a very 
important element in this design. In order to provide a set of 
balanced inputs to the subsequent differential amplifiers (see 
Section II-C), the bias generator is used so that it will generate 
a reference level at node Z), exactly half of that voltage swing 
at node B. This is done by replacing depletion transistor T 2 in 
the sense amp with two equivalent transistors T 6 and T n in the 
bias generator, and use node D as an output. The capacitor 
C D is added to ensure that there is sufficient capacitance loaded 
at node D to filter out any cross talk noise. 
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(a) (b) 
Fig. 4. (a) Miciophotograph of die. (b) Microphotograph of output device. 





(a) (b) 
Fig. 5 . (a) Chip select access time, (b) Address access time, 



It should be pointed out that this bias generator has been 
designed such that the effect of process variations is minimized. 
For instance, T l0 and T u simulate the decoder driver, while 
T n represents the dummy core bit. All components of this 
bias generator circuit have been oriented in the same way on 
the die as their counterparts, so that in almost all aspects it 
represents an exact replica of the core to sense amplifier path. 
The propagation delay of this circuit is typically 3-5 ns. 

D. Differential Amplifiers 

As mentioned in Section U-C, the outputs of the small sig- 
nal amplifier and bias generator are fed to a series of differential 
amplifiers [3] (see Fig. 3). The first of these, stage 1, ampli- 
fies the 500 MV swing to about 1.0 V, with the high and low 
levels at about 4.0 and 3.0 V, respectively. These levels are 
then further amplified, by stage 2, to 4.5 and 2.5 V approxi- 
mately. Stage 3 is actually a level shifter, the purpose of which 
is to shift the 2.5 V low level to as close to V ss as possible. The 
outputs of stage 3 are then fed to the output buffers. 

The total propagation delay, as measured from the crossover 
point of the inputs to stage 1, to that of the output of stage 3, 
is typically 4 ns. 

III. Circuit Performance 

A microphotograph of the circuit is shown in Fig. 4. An 
oscillograph of the typical memory access time is shown in 
Fig, 5. As mentioned earlier, the typical address and chip select 
access time is less than 45 ns. The active and standby currents 
are typically at 70 and 1 5 MA, respectively. All other electrical 
parameters are at least as good, if not better, than the bipolar 
counterparts throughout the temperature and worst case 
conditions. 



IV. Summary 

The circuit described here meets the goal of achieving the 
performance of a bipolar PROM. In addition, it uses less active 
power, is more economical, and provides a power down feature 
that is not available in the bipolar circuits. With the flexible 
contact mask programming capability built into the design, it 
offers an attractive alternative to the PROM users. 
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A 20 ns, Low Power, NMOS IK X 4 Static RAM 
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Abstract -Using scaled NMOS processing and novel circuit design 
techniques to enhance the speed-power product, a high-speed, low 
power, fully static 1024-word X 4-bit random access memory has been 
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